Regression-tree Tuning in a Streaming Setting
نویسندگان
چکیده
We consider the problem of maintaining the data-structures of a partition-based regression procedure in a setting where the training data arrives sequentially over time. We prove that it is possible to maintain such a structure in time O (log n) at any time step nwhile achieving a nearly-optimal regression rate of Õ ( n−2/(2+d) ) in terms of the unknown metric dimension d. Finally we prove a new regression lower-bound which is independent of a given data size, and hence is more appropriate for the streaming setting.
منابع مشابه
An Improvement over Random Early Detection Algorithm: A Self-Tuning Approach
Random Early Detection (RED) is one of the most commonly used Active Queue Management (AQM) algorithms that is recommended by IETF for deployment in the network. Although RED provides low average queuing delay and high throughput at the same time, but effectiveness of RED is highly sensitive to the RED parameters setting. As network condition varies largely, setting RED's parameters with fixed ...
متن کاملConfidence Decision Trees via Online and Active Learning for Streaming Data
Decision tree classifiers are a widely used tool in data stream mining. The use of confidence intervals to estimate the gain associated with each split leads to very effective methods, like the popular Hoeffding tree algorithm. From a statistical viewpoint, the analysis of decision tree classifiers in a streaming setting requires knowing when enough new information has been collected to justify...
متن کاملAn ANOVA Based Analytical Dynamic Matrix Controller Tuning Procedure for FOPDT Models
Dynamic Matrix Control (DMC) is a widely used model predictive controller (MPC) in industrial plants. The successful implementation of DMC in practical applications requires a proper tuning of the controller. The available tuning procedures are mainly based on experience and empirical results. This paper develops an analytical tool for DMC tuning. It is based on the application of Analysis of V...
متن کاملMachine learning in Dynamic Adaptive Streaming over HTTP (DASH)
Koffka Khan Department of Computing and Information Technology The University of the West Indies, Trinidad and Tobago, W.I Email: [email protected] Wayne Goodridge Department of Computing and Information Technology The University of the West Indies, Trinidad and Tobago, W.I Email: [email protected] -------------------------------------------------------------------ABSTRACT----...
متن کاملConfidence Decision Trees via Online and Active Learning for Streaming (BIG) Data
Decision tree classifiers are a widely used tool in data stream mining. The use of confidence intervals to estimate the gain associated with each split leads to very effective methods, like the popular Hoeffding tree algorithm. From a statistical viewpoint, the analysis of decision tree classifiers in a streaming setting requires knowing when enough new information has been collected to justify...
متن کامل